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Abstract 


A search for neutral Higgs bosons decaying into a bb quark pair and produced in 
association with at least one additional b quark is presented. This signature is sen¬ 
sitive to the Higgs sector of the minimal supersymmetric standard model (MSSM) 
with large values of the parameter tan fi. The analysis is based on data from proton- 
proton collisions at a center-of-mass energy of 8 TeV collected with the CMS detector 
at the LHC, corresponding to an integrated luminosity of 19.7fb^^. The results are 
combined with a previous analysis based on 7 TeV data. No signal is observed. Strin¬ 
gent upper limits on the cross section times branching fraction are derived for Higgs 
bosons with masses up to 900 GeV, and the results are interpreted within different 
MSSM benchmark scenarios, light-stau and light-stop. Observed 

95% confidence level upper limits on tan j6, ranging from 14 to 50, are obtained in the 
j^mod+ benchmark scenario. 
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1 Introduction 

The discovery of a Higgs boson with a mass around 125 GeV |[T]-l3| marked a milestone for el¬ 
ementary particle physics. While the measured properties of the observed boson are in agree¬ 
ment with the expectations of the standard model (SM) with the current experimental precision, 
this particle could well be the first visible member of an extended Higgs sector, which would be 
a direct indication of new physics. Extended Higgs sectors are possible in various theoretical 
models, such as Supersymmetry lH-lZl, which relates fermionic and bosonic degrees of freedom 
and in consequence requires the introduction of additional Higgs bosons as well as a super¬ 
partner to each SM particle. The superpartners provide potential dark-matter candidates |8J, 
and their contribution to quantum-loop corrections can lead to a unification of the gauge cou¬ 
plings at higher energies [91 ■ Moreover, the problem of the quadratic divergence of the Higgs 
boson mass at high energies IfTOl is solved naturally through cancellation of loop terms by the 
superpartners. 

The minimal supersymmetric extension of the SM (MSSM) ['5 | contains two scalar Higgs dou¬ 
blets, which result in two charged Higgs bosons, H^, and three neutral ones, jointly denoted 
as (p. Among the latter are two CP-even (h, H) and one CP-odd state (A). The recently discov¬ 
ered boson with a mass near 125 GeV might then be interpreted as one of the neutral CP-even 
states. Two parameters, generally chosen as the mass of the pseudoscalar Higgs boson wj a and 
the ratio of the vacuum expectation values of the two Higgs doublets, tan/I = Vi/ Vi, define 
the properties of the Higgs sector in the MSSM at tree level. For tan /3 values larger than one, 
the couplings of the Higgs field to down-type fermions are enhanced relative to those to the 
up-type fermions. Furthermore, the A boson is nearly degenerate in mass with either the h or 
H boson. These effects enhance the combined cross section for producing these Higgs bosons 
in association with b quarks by a factor of ~2 tan^ fi. The decay ^ > bb is expected to have a 

high branching fraction (^90%), even at large values of the Higgs boson mass lITTI . 

Measurements at the CERN FHC in the ^ > tt decay mode IIT214T5I have lead to the most 

stringent constraints on tan (5 so far, with exclusion limits in the range 4-60 in the mass inter¬ 
val of 90-1000 GeV. Preceding limits had been obtained by the FEP 11^ and Tevatron experi¬ 
ments [TtHI^ . Also the (p ^ fifi decay mode has been investigated Ill^l20l . Besides extending 
the MSSM Higgs boson search to an independent channel, the ^ > bb decay mode is particu¬ 
larly sensitive to the higgsino mass parameter fi KITl . and thus to the bottom quark Yukawa cou¬ 
pling. In the ^ > TT channel, the sensitivity to ji is much smaller due to a partial cancellation 
of the respective radiative corrections between the contributions to the production and decay 
processes lETI . Beyond the MSSM interpretation, lepton-specific two-Higgs-doublet models 
(2HDM) [[22l may allow for enhanced couplings of down-type quarks relative to leptons. The 
bb decay mode is also relevant in the more general context of exotic resonance searches, moti¬ 
vated for example by dark-matter models involving mediator particles with a large coupling 
to b quarks [2311^ . 

Searches in the ^ > bb decay mode have initially been performed at FEP [T^ and by the CDF 
and DO experiments [251 at the Tevatron collider. The first and so far the only analysis at the 
FHC in this channel has been performed by the CMS experiment, using the 7 TeV data, and set 
significantly more stringent bounds in the mass range 90-350 GeV [2^ . 

In this article, the CMS search is extended by adding the data set comprising 19.7 fb^^ of 
proton-proton collision data, collected at a center-of-mass energy of 8 TeV, and by the use of 
a refined methodology. The higher integrated luminosity as well as the greater center-of-mass 
energy allow extension of the search up to a mass of 900 GeV. 
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3 Event reconstruction and simulation 


The search is performed for neutral MSSM Higgs bosons (p with masses > 100 GeV that 
are produced in association with at least one b quark and decay to bb; an illustration of the 
signal process is given by the diagrams in Fig. The signal is thus searched for in final states 
characterized by at least three b-tagged jets. No requirement of a fourth b-tagged jet is made, 
since its kinematic distributions extend significantly beyond the available acceptance, and the 
resulting signal efficiency would be very low. Events are selected by specialized triggers that 
identify b jets already at the online level. This is important to suppress the large rate of multijet 
production at the LHC. The analysis searches for a peak in the invariant mass distribution of 
the two b jets with the highest pj values, which are assumed to originate from the Higgs boson 
decay. The dominant background is the production of heavy-flavor multijet events containing 
either three b jets, or two b jets plus a third jet originating from either a charm quark, a light- 
flavor quark or a gluon, which is misidentified as a b quark jet. For the final limits, the results 
of the 8 TeV analysis are combined with the previous 7TeV analysis Il26l . 



b b 
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Figure 1: Example Feynman diagrams of the signal processes. 


2 The CMS detector 

The central feature of the CMS detector is a superconducting solenoid of 6 m internal diame¬ 
ter, providing a magnetic field of 3.8 T. Within the field volume, the inner tracker is formed 
by a silicon pixel and strip tracker. It measures charged particles within the pseudorapidity 
range \r]\ < 2.5. The tracker provides a transverse impact parameter resolution of approxi¬ 
mately 15 pm and a resolution on pj of about 1.5% for 100 GeV particles. Also inside the field 
volume are a crystal electromagnetic calorimeter, and a brass and scintillator hadron calorime¬ 
ter. Muons are measured in gas-ionization detectors embedded in the steel flux-return yoke, in 
the pseudorapidity range \rj\ < 2.4, with detector planes made using three technologies: drift 
tubes, cathode strip chambers, and resistive-plate chambers. Matching muons to tracks mea¬ 
sured in the silicon tracker results in a px resolution between 1% and 5%, for pj values up to 
1 TeV. Forward calorimetry extends the coverage provided by the barrel and endcap detectors 
up to I p I <5. A detailed description of the CMS detector, together with a definition of the 
coordinate system used and the relevant kinematic variables, can be found in Ref. Il27l . 

3 Event reconstruction and simulation 

A particle-flow algorithm Il28l UM is used to reconstruct and identify all particles in the event, 
i.e. electrons, muons, photons, charged hadrons, and neutral hadrons, with an optimal combi¬ 
nation of all CMS detectors systems. 

The reconstructed primary vertex with the largest p|-sum of its associated tracks is chosen as 
the vertex of the hard interaction and used as reference for the other physics objects. 

Jets are clustered from the reconstructed particle candidates using the anti-kj algorithm 13011 
with a distance parameter of R = 0.5, and each jet is required to pass dedicated quality cri- 
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teria to suppress the impact of instrumental noise and misreconstruction. Contributions from 
additional proton-proton interactions within the same bunch crossing (pileup) affect the jet mo¬ 
mentum measurement. To mitigate this effect, charged particles associated with other vertices 
than the reference primary vertex are discarded in the jet reconstruction, and residual contri¬ 
butions (e.g. from neutral particles) are accounted for using a jet-area based correction KSTI . Jets 
originating entirely from pileup interactions are identified and rejected based on vertex and jet- 
shape information Il32]| . Jet energy corrections are derived from simulation, and are confirmed 
with in situ measurements of the energy balance in dijet and Z/ 7 +jet events Il3^ . 

For the offline identification of b jets, the combined secondary vertex (CSV) algorithm 1341 is 
used. This algorithm combines information on track impact parameters and secondary vertices 
within a jet in a single likelihood discriminant that provides a good separation between b jets 
and jets of other flavors. Secondary-vertex reconstruction is performed with an inclusive vertex 
search amongst the tracks associated with a jet 1351 . 

Simulated samples of signal and background events, also referred to as Monte Carlo (MC) 
samples, were produced using the pythia 113^ and MadGraph 11371 event generators and 
include pileup events. The response of the CMS detector is modeled with Geant4 1381 . The 
MSSM Higgs signal samples, pp — > bb^+X with > bb, were produced at leading order 
in the 4-flavor scheme with PYTHIA version 6.4.12. The px and p distributions of the leading 
associated b jet are in good agreement with the next-to-leading order (NLO) calculations II39I . 
The multijet background from quantum chromodynamics (QGD) processes has been produced 
with PYTHIA, while for tf +jets events the MadGraph event generator was used in its version 
5.1.5.11. For all generators, fragmentation, hadronization, and the underlying event have been 
modeled using pythia with tune Z2*. The most recent PYTHIA 6 Z2* tune is derived from 
the Z1 tune Il40l , which uses the GTEQ5L parton distribution functions (PDF) set, whereas Z2* 
adopts the GTEQ 6 L BTl PDF set. 

4 Trigger and event selection 

A major challenge to this analysis is posed by the huge hadronic interaction rate at the LHG, 
and it is addressed with a dedicated trigger scheme, designed especially to suppress the QGD 
multijet background. Only events with at least two jets in the pseudorapidity range of |j/| < 
1.74 are selected. The leading jet (here and in the following the jets are ordered by decreasing 
px) is required to have px > 80 GeV, while the subleading jet must have px > 70 GeV. Fur¬ 
thermore, the event is only accepted if the absolute value of the difference in pseudorapidity 
between any two jets fulfilling the px and p requirements is less than or equal to 1.74. The tight 
online requirements on the angular variables of the jets are introduced to reduce the trigger 
rates while preserving the signal significances in the probed mass range of the Higgs bosons. 
At the trigger level, b jets are identified using an algorithm that requires at least two tracks with 
high 3D impact parameter significance to be associated with the jet. At least two jets within the 
event must meet the online b tagging criteria to be accepted by the trigger. The efficiency of 
the jet-px requirements in the trigger are derived from the data with zero-bias triggered events. 
The online b tagging efficiencies relative to the offline b tagging selection are obtained from 
simulations of QGD events generated with pythia and scaled to account for the different b 
tagging efficiencies between data and simulation. The total trigger efficiency for events satisfy¬ 
ing the offline selection requirements detailed below ranges from 46-62% over the Higgs boson 
mass range of 100-900 GeV. 

The offline selection requires events to have the two leading jets within |p| < 1.65 to be fully 
within the pseudorapidity windows of the trigger, and the third leading jet within |p| < 2 . 2 . 
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4 Trigger and event selection 


The three leading jets must also pass pj thresholds of 80, 70 and 20 GeV, respectively. In ad¬ 
dition, the two leading jets must have a pseudorapidity difference of |Api 2 | < 1.4, because 
the QCD multijet background increases significantly with respect to the expected signal with 
increasing |Api 2 |- A minimal pairwise separation of AR > 1 between the three leading jets, 
where AR = \/ (Ap)^ -|- {Acp)^ and Arj and A(p are the pseudorapidity and azimuthal angle dif¬ 
ferences (in radians) between the two jets, is imposed to suppress background from b quark 
pairs arising from gluon splitting. 

In the following, "triple-b-tag" and "double-b-tag" samples are introduced, which play crucial 
roles in the analysis. The triple-b-tag sample is the basis for the signal search. It is defined 
by requiring all three leading jets to satisfy a tight CSV b tagging selection requirement at a 
working point characterized by a misidentification probability for light-flavor jets (attributed 
to u, d, s, or g partons) of about 0.1% at an average jet px of 80 GeV. The typical corresponding 
efficiency for b jets is about 50-60% in the central pseudorapidity region. The total number of 
events passing the trigger and offline selections is approximately 69 k. 

The double-b-tag sample plays a key role in the estimation of the multijet background. In this 
selection, only two of the three leading jets must pass the tight CSV b tagging requirement. 
The total number of double-b-tag events remaining after the trigger and offline selections is 
about 2.4 M. While this definition does not explicitly exclude the triple-b-tag events, the po¬ 
tential signal contribution is negligible due to the size of the QCD multijet background in the 
double-b-tag sample, and a veto would lead to distortions in the background model described 
in Section 13 

An additional flavor-sensitive quantity, the secondary vertex mass sum of a jet, EMsv,y, is in¬ 
troduced to further improve the separation between jets of different flavor on top of the CSV b 
tagging requirement. It is defined as the sum of the invariant masses calculated from the tracks 
forming secondary vertices inside a jet, and thus provides additional separation power. The 
extension of the signal mass range compared to the previous 7TeV analysis implies that the 
jets can have larger px, with the consequence that b-tagged jets from background events have 
a higher probability to contain two heavy flavor quarks instead of at most one. This can occur 
for example if a very energetic gluon splits into a pair of b or c quarks with a narrow opening 
angle. For this reason, b and c quark pairs merged into the same jet, labeled as "b2" and "c2", 
respectively, are treated separately from the cases of unmerged b and c quarks, labeled "hi" 
and "cl", respectively. 

The subsequent analysis will use the secondary vertex mass information to categorize events 
and to build background templates. Therefore, the secondary vertex mass sums of the three 
leading jets are combined into a condensed event b tagging estimator, X 123 . The construction 
of this estimator is shown in Table Each selected jet j, where j is the rank of the jet in order of 
decreasing px, is assigned an index Bj, which can take one of the four possible integer values 
from 0-3 according to its secondary vertex mass sum value, as shown in Table(left). For jets 
with no reconstructed secondary vertex, Bj is also set to zero. The definition of these index 
regions is motivated by the population of the secondary vertex mass sum by the different jet 
flavors. From the three indices Bi, B 2 , and B 3 , a combined event b tagging variable X 123 is 
constructed as shown in Table (right). By definition, the event b tagging variable X 123 can 
assume nine possible values ranging from 0 to 8 . The events are then categorized according 
to the value of X 123 , with the rationale of having sufficient statistics in each bin. The signal is 
searched for in the two-dimensional spectrum formed by the invariant mass of the two leading 
jets, Mi 2 , and the event b tagging variable X 123 . 
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Table 1: Left: Definition of the index Bj according to the value of the secondary vertex mass 
sum of the jet. Right: Definition of the values of the combined event b tagging estimator X 123 
for all combinations of the secondary vertex mass sum indices Bi, B 2 , and B 3 . 


ZMsv,i [GeV] 


B 3 

Bl -|- B 2 

0-1 

0 

0-1 2-3 4-6 

1-2 

1 

0-1 

0 1 2 

2-3 

2 

2 

3 4 5 

>3 

3 

3 

6 7 8 


5 Background model 

The main background for this analysis originates from QCD multijet production, with at least 
two energetic jets actually containing b hadrons, and a third jet that passes the b tagging se¬ 
lection but possibly as a result of a mistag. Since this type of background cannot be accurately 
predicted by MC simulation, it is estimated from the data using control samples. The chosen 
method is similar to the one used in Ref. |!42j. The background is modeled by a combination 
of templates, which are constructed from the double-b-tag sample. Only the shape of these 
background templates is relevant, since the normalization will be determined by the fit to the 
data. 

Three categories of events are distinguished in the double-b-tag sample, which are denoted 
as xbb, bxb and bbx depending on whether the jet with the highest, second-highest or third- 
highest Pt is exempt from the b-tag requirement. In this notation the three jets are referred to 
in order of decreasing pj. 

From these three double-b-tag categories, background templates are constructed by weighting 
each untagged jet with the b tagging probability according to its assumed flavor. In the template 
nomenclature, the convention is to indicate the assumed flavor with a capital letter, and it can 
be one of the five options Q, Cl, Bl, C2, and B2, where Q refers to light quarks or gluons, while 
Cl and C2 refer to a jet with a single charm quark and a pair of charm quarks, respectively. 
Similarly, Bl and B2 refer to jets assumed to contain a single bottom quark and a pair of bottom 
quarks, respectively. The total number of templates is therefore 15. Each background template 
is a binned distribution in the two-dimensional space spanned by M 12 , the dijet mass of the two 
leading jets, and the event b tagging variable X 123 . For the construction of each template, each 
event is weighted with the b tagging probability corresponding to the assumed flavor of the 
untagged jet. This weight accounts for the effect of the b tagging discriminant threshold. The 
b tagging probability for each flavor is determined with simulated QCD multijet events, where 
the flavor selection is based on Monte Carlo truth information. Data/MC scale factors for the 
b tagging efficiencies are applied where appropriate Il34l . Since the b tagging efficiency has a 
characteristic dependence on pj and rj for each flavor, the weighting results in different shapes 
of the Mi 2 distributions. The X 123 dimension of the templates is modeled in the following way: 
In a given M 12 bin, an event can contribute to different X 123 bins depending on the flavor of its 
jets and its kinematics. For the two b-tagged jets, the secondary vertex mass sum information 
is taken as measured. For the untagged jet, each of the four possible values of the secondary 
vertex mass sum index is taken into account with a weight according to the probability that a 
jet with given flavor, pj and f] will assume this value. These probabilities, parametrized as a 
function of the jet px and rj, were determined from simulated events, and validated in control 
data samples. 

Two additional corrections are applied to the templates. The first correction addresses a con¬ 
tamination in the double-b-tag sample from non-bb events at the level of a few percent. This 
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5 Background model 


CMS, 19.7fb '(8TeV) CMS, 19.7 fb' (8 TeV) 




Figure 2: Projections of M 12 (left) and X 123 (right) for the five background templates used in the 
fit. The vertical scale is shown in arbitrary units. 

contamination is estimated directly from the data using a negative b tagging discriminator IMl 
constructed with a track counting algorithm based on the negative impact parameter of the 
tracks, ordered from the most negative impact parameter significance upward. A second cor¬ 
rection is required since the online b tagging patterns are different in the double- and the triple- 
b-tag samples. In double-b-tag events, the two online b tags usually coincide with the offline 
b tags, while in triple-b-tag events the online b tags can be assigned to any two-jet subset of 
the three leading jets. The correction is computed from simulated QCD multijet events, and is 
applied in the form of additional weights to the events in the double-b-tag sample. 

Similarity in shape between some templates leads to unnecessary redundancy. For this reason, 
similar templates are combined using a -based metric to guide the decisions. The relative 
weights in a combination are taken from MC. In the cases where one of the two leading jets is 
untagged, and the flavor assumption is the same, e.g. Qbb, and bQb, the templates are com¬ 
bined, resulting in a merged template (Q,b)b = Qbb + bQb. By analogy, also (Cl,b)b, (Bl,b)b, 
(C2,b)b, and (B2,b)b are obtained. The resulting set of ten templates still shows many similar¬ 
ities. For this reason, (Bl,b)b, (B2,b)b, and (C2,b)b are combined into a single template; bbBl, 
bbB2, and bbC2 into a second; and bbCl and bbQ into a third. The total number of templates 
to be fitted in combination to the data is thus reduced to five, namely (B2+Bl+C2,b)b, (Cl,b)b, 
(Q,b)b, bb(B2+Bl+C2), and bb(Cl+Q). The projections of the M 12 and X 123 variables are shown 
in Fig.|^for these five background templates. 

Beyond QCD multijet production, top-quark pair (tf) events pose the largest potential back¬ 
ground to the signal topology. The requirement of three b-tagged jets reduces this background 
substantially, since only two highly energetic b-tagged jets are expected from the decays of 
the top quarks. However, one of the W bosons can decay into a cs pair, and the c jet can be 
mistagged as b jet. Using the tt Monte Carlo sample, the tf contribution is found to be rel¬ 
atively small; the number of tf events passing the selections of the double- and triple-b-tag 
datasets it estimated to be about a factor of 70 smaller than the total amount of data in these 
samples. The invariant mass spectrum from tf is very similar to the one from the QCD mul¬ 
tijet background, and does not show any narrow peaks. Since the tf events contribute to the 
double-b-tag sample, they are also taken into account in the background model. 
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6 Signal modeling 

6.1 Signal templates 

A signal template is obtained for each MSSM Higgs boson mass considered by applying the 
full selection to the corresponding simulated signal data set, for nominal masses in the range 
of 100-900 GeV. The sensitivity of this analysis does not extend down to cross sections as low 
as that of the SM Higgs boson. Thus, a signal model with a single mass peak is sufficient, in 
contrast to the ^ > tt analysis fl4L where the signal model comprises the three neutral Higgs 
bosons of the MSSM, one of which is SM-like. The projections for the M 12 and X 123 distributions 

CMS Simulation (8 TeV) CMS Simulation (8 TeV) 




Figure 3: The M 12 (left) and X 123 (right) projections of the signal templates for Higgs boson 
masses of = 200, 350 and 500 GeV. The vertical scale is shown in arbitrary units. 

of the signal templates for three different Higgs boson masses are shown in Fig. The shape 
of the mass distribution is dominated by the experimental resolution and the combinatorial 
background. The natural width expected for a MSSM Higgs boson in the considered mass and 
tan j 6 region is negligible in comparison with the detector resolution. At a mass of 500 GeV and 
tan j 6 = 50, for example, the natural width of the mass peak is found to be 13 GeV, which is 
only %14% of the RMS of the reconstructed mass distribution. The X 123 distributions show 
little variation with the MSSM Higgs boson mass; they reflect the triple-b-quark signature. 

6.2 Signal efficiency 

The signal efficiency for each MSSM Higgs mass point is obtained from the simulated data 
sets. The efficiency of the kinematic trigger selection has been derived with data from control 
triggers and is applied by weighting. Scale factors to account for the different b tagging ef¬ 
ficiencies in data and MG ||34l are also applied. The efficiency ranges between 0.17 and 6.38 
per mille and peaks around 300 GeV. The detailed mass dependence is shown in [Appendix A| 
The decrease of the efficiency for masses beyond 300 GeV is due to the degradation of the b 
tagging efficiency at high jet px- For masses around 300 GeV the kinematic selections give rise 
to an efficiency of approximately 0.12, which is reduced to approximately 0.0065 when triple b 
tagging is required. 
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7 Systematic uncertainties 


6.3 Fitting procedure 

The overall two-dimensional distribution in the variables M 12 and X 123 is fitted by a model 
combining the background templates and optionally a signal template. A binned likelihood 
technique is used. The relative contribution of each template is determined by the fit. The sys¬ 
tematic uncertainties are represented by nuisance parameters that are varied in the fit according 
to their probability density functions. 


7 Systematic uncertainties 

The following systematic uncertainties in the expected signal and background estimates affect 
the determination of the signal yield and/or its interpretation within the MSSM. 

Uncertainties in the yields of the signal contributions include the uncertainty in the luminosity 
estimate ||4^ , the statistical uncertainties in the signal MC samples, and the uncertainties of 
the relative online b tagging corrections. Also taken into account are the QCD renormalization 
and factorization scale (}ir, Hf) uncertainties, the uncertainties due to the parton distribution 
functions (PDF) and the strong coupling constant ocs, and the uncertainties in the underlying 
event and parton shower modeling, which all only affect the translation of the signal cross 
section into tan j5 in the MSSM interpretation. The impact of these uncertainties on the signal 
acceptance is not significant. 

The rate as well as the shape of the signal contributions are also affected by the uncertainties in 
the trigger efficiencies, the jet energy scale, the jet energy resolution, and the pileup modeling, 
as well as the scale factors for the b-tag efficiency, the mistag rate, and the secondary vertex 
mass scale. The last three also affect the shapes of the background templates (recall that only 
the shape is relevant for the background templates). The statistical uncertainty in the template 
shape, due to the limited size of the double-b-tag sample and due to the uncertainty in the 
offline b-tag efficiencies and mistag rates, are propagated into the templates and accounted for 
in the fitting procedure. Additional systematic uncertainties in the shapes of the background- 
templates arise from the impurity of the double-b-tag sample and the online b tag correction 
to the templates. The sources and types of systematic uncertainties and their impact on the 
expected limit are summarized in Table 

Table 2: Systematic uncertainties and their relative impact on the expected limit. The values 
represent an average over the mass range from 100-900 GeV, except for the template statistical 
and the offline b tagging (be) uncertainties, where ranges are given. 


Source 

Type 

Target 

Impact 

Online b tagging 

Rate 

Signal 

11 % 

Integrated luminosity 

Rate 

Signal 

0 . 1 % 

Jet trigger 

Rate + Shape 

Signal 

0 . 1 % 

Jet energy scale 

Rate + Shape 

Signal 

0.5% 

Jet energy resolution 

Rate + Shape 

Signal 

0 . 1 % 

Offline b tagging (be) 

Rate + Shape 

Signal + Background 

2-16% 

Offline b tagging (udsg) 

Shape 

Background 

0 . 2 % 

Template stat. uncertainty 

Shape 

Background 

1 - 21 % 

Secondary vertex mass sum 

Shape 

Signal + Background 

0.9% 

bb purity correction 

Shape 

Background 

3.4% 

Online b tagging correction 

Shape 

Background 

0.5% 
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8 Results 


8.1 Background-only fit 

In the first step, an unconstrained fit is performed without inclusion of a signal template, in¬ 
volving a linear combination of the background templates only. Results are shown in Fig.|^and 
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Figure 4: Projections of the dijet mass M 12 (left) and event b-tag variable X 123 (right) in the 
triple-b-tag sample, together with the corresponding projections of the fitted background tem¬ 
plates. The hatched area shows the total bin-by-bin background uncertainty of the templates 
prior to the fit, which takes into account the limited size of the double-b-tag sample and the 
uncertainties of the offline b-tag efficiencies and mistag rates. For illustration, the signal contri¬ 
bution expected in the benchmark scenario of the MSSM with niA = 350 GeV, tan /3 = 30, 
and j4 = -I-200 GeV is overlayed, scaled by a factor 10 for better readability. In addition, the 
ratio of data to the background estimate is shown at the bottom. 


Table The template-based background model describes the data well within the uncertainty 
of the template fits with a goodness-of-fit of x^/^dot = 207.9/209, where N^of is the number 
of degrees of freedom, corresponding to a p-value of 0.51. As expected, the fit is dominated by 
templates involving triple b-jet signatures, whose fitted total contributions amount to ?a82%. 


8.2 Combined fit of signal and background templates 

In the second step, a signal template is included together with the background templates in the 
fit, with the relative fractions of signal and background templates allowed to vary freely. The fit 
is performed for all considered Higgs boson masses from 100 to 900 GeV. None of the fits shows 
any significant signal excess. Results for a Higgs boson mass of 350 GeV are shown in Fig. 
and Table At this mass point, the highest fluctuation in the fitted Higgs boson production 
cross section is observed, corresponding to a local significance of approximately 1.5 standard 
deviations. The goodness-of-fit is X^/^doi = 205.2/208, corresponding to a p-value of 0.54. 


8.3 Upper limits on cross sections times branching fractions 

Gross sections are obtained from the fractions determined by the fit multiplied by the total 
number of data events after the selection in the signal region, and divided by the corresponding 
signal efficiencies (Section|6.2[) and the integrated luminosity of 19.7fb^^. 
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Figure 5: Results from the combined fit of signal and background templates in the triple-b- 
tag sample, at the 350 GeV mass point. The left plot shows the projections of the dijet mass 
Mi 2 , the right plot the projections of the event b-tag variable X 123 . The red graph represents 
the fitted Higgs signal contribution. The hatched area shows the total bm-by-bin background 
uncertainty of the templates prior to the fit, which takes into account the limited size of the 
double-b-tag sample and the uncertainties of the offline b-tag efficiencies and mistag rates. In 
addition, the ratio of data to the background estimate is shown at the bottom. 


Table 3: Relative contributions of the individual templates as determined by the background- 
only and by the signal-rbackground fit for a Higgs boson mass hypothesis of 350 GeV. 


Template 

Background-only fit 
fraction [%] 

Signal-Pbackground fit 
fraction [%] 

(B2-pBl-pG2,b)b 

51.3 ±3.5 

49.5 ± 3.9 

(Gl,b)b 

1.3 ±2.3 

1.7±3.1 

(Qrb)b 

1.2 ± 2.0 

1.1 ±1.5 

bb(B2+Bl+G2) 

31.2 ±3.2 

32.2 ± 3.4 

bb (Gl+Q) 

15.1 ±0.9 

15.0 ±0.9 

hh(p{m = 350 GeV) 

— 

0.5 ±0.3 
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Figure 6: Expected and observed upper limits at 95% CL on C7(pp hep + X) B{(p ^ bb) as a 
function of m^j,, where cp denotes a generic neutral Higgs-like state. 


In the absence of any significant signal, the results are translated into upper limits on the cross 
section times the branching fraction, r7(pp hep + X) B{(p ^ bb), of a generic Higgs-like state 
in the mass range 100-900 GeV. For calculations of exclusion limits, the modified frequentist 
construction CL dliHli^ is adopted using the RooStats package 1461 . The chosen test statistic, 
used to determine how signal- and background-like the data are, is based on the profile likeli¬ 
hood ratio. Systematic uncertainties are incorporated in the analysis via nuisance parameters 
and treated as pseudo-observables, following the frequentist paradigm. These uncertainties 
have been listed in Section 0 


The observed and the median expected 95% confidence level (CL) limits as a function of the 
Higgs boson mass are shown in Fig.|^and listed in Table [^in |Appendix B The lu and lei bands 
of the test statistic, including systematic uncertainties, are also shown. 


8.4 Interpretation within the MSSM 

The cross section limits shown in Fig.|^are further translated into exclusion limits on the MSSM 
parameters tan jS and nij^. The cross sections obtained with the four-flavor NLO QCD calcula¬ 
tion Il47l |48l and the five-flavor NNLO QCD calculation as implemented in bbh@nnlo t49]| 
for b T-h/H/A associated production have been combined using the Santander matching 
scheme Il50l . The branching fractions were computed with the FeynHiggs ll5Tl - l54l and HDE- 
CAY [SSlISll programs as described in Ref. IHTI . 

The observed and expected 95% CL median upper limits on tan jS versus jha, together with 
the Ice and ler bands, are shown in Fig. |^(left). They have been computed within the tradi¬ 
tional MSSM benchmark scenario Il5^ with the higgsino mass parameter ^ = -|-200 GeV. 
The observed upper limits range from tan about 20 in the Iow-wa region to about 50 at 
niA = 500 GeV, and extend the existing measurement at 7 TeV Il2^ into the hitherto unexplored 
niA region beyond 350 GeV. The model interpretation is not extended to higher masses above 
500 GeV because the theoretical predictions are not reliable for tan /3 much higher than 60. 

While the cross section limits obtained from the 2011 and 2012 data cannot be combined directly 
due to the different center-of-mass energies, such a combination is possible for the model- 
dependent interpretation. The resulting upper limits on tan /3 versus rriA from both data periods 
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Figure 7: Expected and observed upper limits at 95% CL for the MSSM parameter tan versus 
Wa in the benchmark scenario with = +200 GeV. The excluded parameter space (ob¬ 
served limit) is indicated by the shaded area. Regions where the mass of neither of the CP-even 
MSSM Higgs bosons h or H is compatible with the discovered Higgs boson of 125 GeV within 
a range of 3 GeV are marked by the hatched areas. The left plot shows the result obtained with 
the 8 TeV data only, the right plot shows the result obtained after a combination with the 7 TeV 
data. For comparison, the expected limit of the 7 TeV data analysis Ii2^ is overlayed. 


are shown in Figure]^ (right). While the sensitivity is significantly enhanced compared to the 
7 TeV analysis 1126] already up to 350 GeV, the addition of the 7 TeV result visibly improves the 
sensitivity in the low-mass area below 200 GeV. The observed limit for tan ranges down to 
about 14 at the lowest ?«a value considered. 


Association of one of the CP-even MSSM Higgs bosons h and H with the measured state at a 
mass of 125 GeV within a margin of ±3 GeV that reflects the theoretical uncertainties 1(2X1 leads 
to an indirect constraint on tan j6. The incompatible regions in the parameter space are illus¬ 
trated by the hatched areas in both plots in Fig. In the scenario, the MSSM parameters 
beyond tree level have been tuned such that ni\^ becomes as large as possible. As a result, large 
?«A and already moderate values of tan /3 lead to ?«h values that are higher than the measured 
Higgs boson mass. This apparent exclusion of large tan j6 values is, however, an artificial con¬ 
sequence of the assumptions in the scenario. Recently, several new MSSM benchmark 
scenarios have been proposed, which are more naturally compatible with the observed Higgs 
boson at 125 GeV li2T] , and among them the light-stop, and light-stau scenarios 

are also used in the following for the interpretation of the results of this analysis. The observed 
and expected 95% CL exclusion limits in these scenarios with = +200 GeV, obtained with the 
combined 7 and 8 TeV data, are shown in Fig.|^ (The term "stop" refers to the supersymmetric 
partner of the top quark throughout this paper. Results for the T-phobic and low-mn scenarios 
are not shown because the analysis has sensitivity in a limited mass region only.) The limits 
obtained in all MSSM benchmark scenarios are listed in Tables [6] to [IT] in 


Appendix B 


The aforementioned sensitivity of the bb channel to the higgsino mass parameter /r is 

evident in Fig.|^ where the limit in the scenario is compared for different values of ji. The 

dependence is particularly pronounced at higher ?«Ai for example, the observed upper limit on 
tan j6 varies from 30 for ji = —500 GeV to beyond 60 for ji = +500 GeV for wja = 500 GeV. The 


limits are also listed in Table 12 in Appendix B 
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Figure 8: Expected and observed upper limits at 95% CL for the MSSM parameter tan/3 
versus wja in the light-stop, and light-stau benchmark scenarios with }i = 
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Figure 9: Expected and observed upper limits at 95% CL for the MSSM parameter tan versus 
Wa for four different values of the higgsino mass parameter ^ (left) and versus for three 
different values of ma (right) in the scenario. 


9 Summary 

A search for a Higgs boson decaying into a pair of b quarks and accompanied by at least one 

additional b quark has been performed in proton-proton collisions at a center-of-mass energy of 

_1 

8 TeV at the LHC, corresponding to an integrated luminosity of 19.7fb . The data were taken 
with dedicated triggers using all-hadronic jet signatures combined with online b tagging. A 
selection of events with three b-tagged jets has been performed in the offline analysis. A signal 
has been searched for in the two-dimensional spectrum formed by the invariant mass of the 
two leading jets and a condensed event b-tag estimator. 

No evidence for a signal is found. The observed distributions are well described by a back¬ 
ground model constructed from events in which only two of the three leading jets are required 
to be b tagged. Upper limits on the Higgs boson cross section times branching fraction are ob¬ 
tained in the mass region from 100-900 GeV, thus extending the search to considerably higher 
masses than those accessed by the previous 7 TeV analysis. The upper limits range from about 
250 pb at the lower end of the mass range, to about 1 pb at 900 GeV. 

The results are interpreted within the MSSM in the benchmark scenarios 
light-stau and light-stop, and lead to upper limits for the model parameter tan j6 as a function of 
the mass parameter niA- In combination with the 7 TeV data, the observed limit for tan /I ranges 
down to about 14 at the lowest ma value of 100 GeV in the scenario with a higgsino mass 

parameter of = -1-200 GeV. The limit depends significantly on ji, varying from tan j6 = 30 for 
}i = —500 GeV to beyond 60 for ji = -|-500 GeV at niA = 500 GeV. 
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Appendix A Signal Efficiency 

as a function of the 

Higgs boson mass. 


The signal efficiencies are summarized in Table and shown in Eig. 10 


Table 4: The total signal efficiency in per mille as a function of the Higgs boson mass for a 
center-of-mass energy of 8 TeV. 


[GeV] Efficiency [per mille] 

100 

0.17 

140 

0.57 

160 

1.03 

200 

2.85 

300 

6.38 

350 

6.32 

400 

6.08 

500 

5.07 

600 

3.85 

700 

2.90 

900 

1.39 


Appendix B Exclusion limits 

The model-independent 95% CL limits on £r(pp —^h(p + X) B{<p —> bb) are listed in Table|^for 
different Higgs boson masses m^. The 95% CL limits of (tan/I,m a) are listed in Tables]^ to [TT| 
for different MSSM benchmark scenarios with y = -|-200 GeV and for different values of y in 
the scenario in Table 
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Appendix B Exclusion limits 


CMS Simulation (8 TeV) 



[GeV] 


Figure 10: The signal efficiency as a function of the Higgs boson mass for a center-of-mass 
energy of 8 TeV. 

Table 5: Expected and observed 95% CL upper limits on cr(pp h(p + X) B{(p ^ bb) in pb as 
a function of m^, where (p denotes a generic Higgs-like state, as obtained from the 8 TeV data. 


Mass [GeV] 

-2cr 

— Icr 

Median 

+lcr 

+2(7 

Observed 

100 

160.7 

221.0 

330.4 

518.8 

811.2 

251.9 

140 

49.4 

68.0 

101.6 

161.2 

254.7 

158.8 

160 

25.9 

35.6 

52.5 

81.6 

126.1 

68.7 

200 

13.7 

19.0 

28.2 

44.1 

68.6 

17.8 

300 

3.0 

4.1 

6.1 

9.4 

14.5 

10.5 

350 

1.9 

2.7 

3.9 

6.1 

9.3 

7.1 

400 

1.3 

1.8 

2.7 

4.2 

6.4 

2.4 

500 

0.8 

1.2 

1.7 

2.7 

4.2 

1.5 

600 

0.6 

0.9 

1.3 

2.0 

3.2 

0.7 

700 

0.5 

0.7 

1.1 

1.8 

2.8 

1.3 

900 

0.4 

0.6 

1.0 

1.6 

2.7 

0.8 


Table 6: Expected and observed 95% CL upper limits on tan /3 as a function of ma in the 
ji = +200 GeV, benchmark scenario obtained from the 8 TeV data only. 


Mass [GeV] 

-2(7 

— 1(7 

Median 

+ 1(7 

+2(7 

Observed 

100 

14.4 

17.4 

22.0 

29.3 

39.4 

18.8 

140 

15.5 

18.4 

22.9 

30.1 

40.1 

29.4 

160 

13.6 

16.2 

20.2 

26.4 

34.9 

23.5 

200 

15.2 

18.1 

22.8 

29.9 

40.0 

17.7 

300 

18.0 

21.0 

25.9 

33.4 

43.9 

34.9 

350 

21.8 

25.4 

31.0 

39.7 

52.2 

42.6 

400 

25.1 

29.3 

36.0 

46.2 

— 

33.9 

500 

36.4 

42.7 

52.9 

— 

— 

49.4 
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Table 7: Expected and observed 95% CL upper limits on tan /3 as a function of OTa in the 
fi = +200 GeV, benchmark scenario obtained from a combination of the 7 and 8 TeV data. 


Mass [GeV] 

-2(7 

— 1(7 

Median 

+ 1(7 

+2(7 

Observed 

100 

13.1 

15.6 

19.4 

24.6 

31.2 

13.9 

140 

13.8 

16.1 

19.5 

24.2 

29.8 

22.3 

160 

12.3 

14.5 

17.6 

22.0 

27.2 

17.8 

200 

13.2 

15.5 

18.8 

23.3 

28.5 

14.5 

300 

17.3 

20.1 

24.4 

30.5 

38.5 

33.5 

350 

21.1 

24.5 

29.6 

36.9 

46.4 

36.5 

400 

25.1 

29.3 

36.0 

46.2 

— 

33.9 

500 

36.4 

42.7 

52.9 

— 

— 

49.4 


Table 8: Expected and observed 95% CL upper limits on tan /3 as a function of jwa in the 
fi = +200 GeV, benchmark scenario obtained from a combination of the 7 and 8 TeV data. 


Mass [GeV] 

-2(7 

— 1(7 

Median 

+ 1(7 

+2(7 

Observed 

100 

13.4 

16.0 

19.8 

25.1 

31.9 

14.2 

140 

13.8 

16.2 

19.6 

24.3 

30.1 

22.4 

160 

12.6 

14.8 

18.0 

22.4 

27.7 

18.2 

200 

13.5 

15.8 

19.2 

23.8 

29.1 

14.8 

300 

17.6 

20.5 

24.8 

31.1 

39.2 

34.1 

350 

21.4 

24.8 

30.0 

37.5 

47.1 

37.1 

400 

25.5 

29.8 

36.5 

46.9 

— 

34.4 

500 

36.8 

43.2 

53.5 

— 

— 

50.0 


Table 9: Expected and observed 95% CL upper limits on tan j6 as a function of OTa in the , 
ji = +200 GeV, benchmark scenario obtained from a combination of the 7 and 8 TeV data. 


Mass [GeV] 

-2(7 

— 1(7 

Median 

+ 1(7 

+2(7 

Observed 

100 

12.7 

15.0 

18.3 

22.8 

28.3 

13.4 

140 

13.1 

15.2 

18.1 

22.2 

26.9 

20.6 

160 

11.9 

13.9 

16.7 

20.5 

24.9 

16.9 

200 

12.8 

14.9 

17.9 

21.7 

26.1 

13.9 

300 

16.5 

18.9 

22.6 

27.7 

33.9 

30.1 

350 

19.8 

22.7 

26.9 

32.7 

39.7 

32.4 

400 

23.2 

26.7 

32.0 

39.7 

49.7 

30.4 

500 

32.3 

37.1 

44.4 

55.1 

— 

41.9 


Table 10: Expected and observed 95% CL upper limits on tan j6 as a function of jwa in the light- 
stau, = +200 GeV, benchmark scenario obtained from a combination of the 7 and 8 TeV data. 


Mass [GeV] 

-2(7 

— 1(7 

Median 

+ 1(7 

+2(7 

Observed 

100 

14.4 

17.4 

22.2 

29.1 

38.6 

15.4 

140 

15.0 

17.8 

22.0 

28.2 

36.2 

25.7 

160 

13.5 

16.1 

20.0 

25.6 

32.9 

20.2 

200 

14.4 

17.2 

21.4 

27.4 

34.8 

15.9 

300 

17.8 

21.6 

27.6 

36.7 

48.9 

41.4 

350 

21.0 

25.7 

33.1 

44.8 

— 

44.1 

400 

25.7 

31.8 

42.4 

— 

— 

39.0 

500 

40.3 

52.1 

— 

— 

— 

— 
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Appendix B Exclusion limits 


Table 11: Expected and observed 95% CL upper limits on tan j6 as a function of jwa in the light- 
stop, = -|-200 GeV, benchmark scenario obtained from a combination of the 7 and 8 TeV data. 


Mass [GeV] 

-2(7 

— 1(7 

Median 

-61(7 

+2(7 

Observed 

100 

15.3 

18.9 

24.7 

34.1 

49.2 

16.5 

140 

15.9 

19.1 

24.3 

32.6 

44.9 

29.1 

160 

14.4 

17.4 

22.1 

29.6 

40.2 

22.4 

200 

15.5 

18.8 

24.2 

32.3 

43.8 

17.3 

300 

19.7 

24.5 

32.7 

47.6 

— 

56.8 

350 

23.6 

29.9 

41.4 

— 

— 

— 

400 

29.7 

39.0 

58.6 

— 

— 

51.7 

500 

52.5 

— 

— 

— 

— 

— 


Table 12: Observed (expected) 95% CL upper limits on tan /3 as a function of niA in the 
benchmark scenario for different values of the higgsino mass parameter fi obtained from a 
combination of the 7 and 8 TeV data. 


Mass [GeV] 

^ = -500 GeV 

^ = -200 GeV 

^ = +200 GeV 

^ = +500 GeV 

100 

12.9 (16.6) 

13.7(18.1) 

14.2 (19.8) 

16.1 (22.7) 

140 

18.2 (16.4) 

19.9 (17.8) 

22.4 (19.6) 

26.1 (22.5) 

160 

15.5 (15.4) 

16.7 (16.6) 

18.2 (18.0) 

20.8 (20.6) 

200 

13.1 (16.2) 

14.0 (17.5) 

14.8 (19.2) 

16.6 (21.9) 

300 

24.2 (19.0) 

27.6 (21.3) 

34.1 (24.8) 

41.0 (27.8) 

350 

25.1 (21.3) 

29.7 (25.1) 

37.1 (30.0) 

43.8 (33.2) 

400 

23.5 (24.6) 

28.2 (29.5) 

34.4 (36.5) 

39.0 (42.2) 

500 

30.3 (31.8) 

37.8 (39.6) 

50.0 (53.5) 

-(-) 

600 

33.2 (41.0) 

42.3 (52.8) 

57.5 (-) 

-(-) 

700 

54.3 (51.3) 

-(-) 

-(-) 

-(-) 
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